
SEQUENCE LISTING £fy ^ 



(1) GENERAL INFORMATION: 'Sfa 



(i) APPLICANT: Tripp, Cynthia A. 

Frank, Glenn R. 
Grieve, Robert B. 

(ii) TITLE OF INVENTION: NOVEL PARASITIC HELMINTH 
P22U PROTEINS 

(iii) NUMBER OF SEQUENCES: 17 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Sheridan Ross P.C. 

(B) STREET: 1700 Lincoln St., Suite 3500 

(C) CITY: Denver 
{ D) STATE: CO 

(E) COUNTRY: U.S.A. 

(F) ZIP: 80203 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/460,428 

(B) FILING DATE: 02-JUN-1995 

(C) CLASSIFICATION: 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Connell, Gary J. 

(B) REGISTRATION NUMBER: 32,020 

(C) REFERENCE/DOCKET NUMBER: 2618-13-3 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 303/863-9700 

(B) TELEFAX: 303/863-0223 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 913 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 911 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



GC GAG TTA AAT AGT CGA ATT TCC GGA GTA CAC CGT AAT ACT GCA GGT 



• 



47 

Glu Leu Asn Ser Arg lie Ser Gly Val His Arg Asn Thr Ala Gly 
15 10 15 

GCT TTA CAA CGA TTT GCT CTA AAT GGT CAA AAT ACT CTT AAC GAA GGA 
95 

Ala Leu Gin Arg Phe Ala Leu Asn Gly Gin Asn Thr Leu Asn Glu Gly 
20 25 30 

TCA AGT TAT GAG CCA AAC GGA CTA TTT GTA TTT TCA GCA ATA AAC GGT 
143 

Ser Ser Tyr Glu Pro Asn Gly Leu Phe Val Phe Ser Ala lie Asn Gly 
35 40 45 

AGC CAT ACT GAT AGC TTA TCT CAG TAT GGT GAA GGA ATA AAT GAA AAT 
191 

Ser His Thr Asp Ser Leu Ser Gin Tyr Gly Glu Gly lie Asn Glu Asn 
50 55 60 

TAT CAT TCT GGA ACT AAT TAT TAT GAT GAA GTA GAA TTA AGA GAT AAA 
239 

Tyr His Ser Gly Thr Asn Tyr Tyr Asp Glu Val Glu Leu Arg Asp Lys 
65 70 75 

ACA AAT CAG ACA TCG TAC ATT AAT GGA AAT GAT AAT GGA ATC AAT GGA 
287 

Thr Asn Gin Thr Ser Tyr lie Asn Gly Asn Asp Asn Gly lie Asn Gly 

80 85 90 95 

AAG GAT GAT GAA GAT CTG GAT GAA TGC TCT GAT CAA GAA TTC CGA TGT 
335 

Lys Asp Asp Glu Asp Leu Asp Glu Cys Ser Asp Gin Glu Phe Arg Cys 
100 105 110 

CCA TAT CTA GCT AAA ACA CTT TGT GTT CAT TAT TTG AAA ATA TGC GAT 
383 

Pro Tyr Leu Ala Lys Thr Leu Cys Val His Tyr Leu Lys lie Cys Asp 
115 120 125 

GGT ATT GAT GAT TGT GGT GAT GGA AGT GAT GAA ATG AAC TGT GCT GAT 
431 

Gly lie Asp Asp Cys Gly Asp Gly Ser Asp Glu Met Asn Cys Ala Asp 
130 135 140 

GAT GAA GTG ATA ACA TCA ATA AAT GGT AAC GAA TCA ATC AAT ATC AGA 
479 

Asp Glu Val lie Thr Ser lie Asn Gly Asn Glu Ser lie Asn lie Arg 
145 150 155 

TGT GAT CCG GAT CAA TTT CGA TGT GAA AAT GGA AAA TGT ATC GCA CAA 
527 

Cys Asp Pro Asp Gin Phe Arg Cys Glu Asn Gly Lys Cys lie Ala Gin 
160 165 170 175 

ATT GAT CGA TGT AAT CGA AAA TAT GAT TGT GAT GAT GGT ACA GAT GAA 
575 

lie Asp Arg Cys Asn Arg Lys Tyr Asp Cys Asp Asp Gly Thr Asp Glu 
180 185 190 

ACA ACT TGT GAA TAT TTC GTG CAA GCT TTG CAA CAA GCG AGA GGT GTA 
623 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 303 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Glu Leu Asn Ser Arg He Ser Gly Val His Arg Asn Thr Ala Gly Ala 
15 10 15 

Leu Gin Arg Phe Ala Leu Asn Gly Gin Asn Thr Leu Asn Glu Gly Ser 
20 25 30 

Ser Tyr Glu Pro Asn Gly Leu Phe Val Phe Ser Ala He Asn Gly Ser 
35 40 45 

His Thr Asp Ser Leu Ser Gin Tyr Gly Glu Gly He Asn Glu Asn Tyr 
50 55 60 

His Ser Gly Thr Asn Tyr Tyr Asp Glu Val Glu Leu Arg Asp Lys Thr 



65 



70 



75 



80 



Asn Gin Thr Ser Tyr lie Asn Gly Asn Asp Asn Gly lie Asn Gly Lys 
85 90 95 



Asp Asp Glu Asp 
100 



Leu Asp Glu Cys Ser Asp Gin Glu Phe Arg Cys Pro 
105 110 



Tyr Leu Ala Lys 
115 



Thr Leu Cys Val His Tyr Leu Lys lie Cys Asp Gly 
120 125 



lie Asp Asp Cys 
130 



Gly Asp Gly Ser Asp Glu Met Asn Cys Ala Asp Asp 
135 140 



Glu Val He Thr 
145 



Ser He Asn Gly Asn Glu Ser He Asn He Arg Cys 
150 155 160 



Asp Pro Asp Gin Phe Arg Cys Glu Asn Gly Lys Cys He Ala Gin He 
165 170 175 



Asp Arg Cys Asn 
180 



Arg Lys Tyr Asp Cys Asp Asp Gly Thr Asp Glu Thr 
185 190 



Thr Cys Glu Tyr 
195 



Phe Val Gin Ala Leu Gin Gin Ala Arg Gly Val Thr 
200 205 



Val Gin Asp Asn 
210 



Ala He Arg Asp Asp Glu He Pro Asn Tyr Thr Val 
215 220 



Ser Met Glu Gin 
225 



Lys Tyr Asp Gin Val Lys Glu Asp Lys Glu Arg Arg 
230 235 240 



Met Gin Glu Glu Glu Glu Gin Glu Arg Leu Arg Glu Tyr Glu Glu Gin 
245 250 255 



He Gin Glu Lys 
260 



Leu Arg Gin Glu Glu Glu Arg Glu Arg Gin Glu Gin 
265 270 



Glu Arg Arg Gin 
275 



Lys Glu Arg Glu Arg Met Glu Gin Glu Arg He Arg 
280 285 



Gin Glu Tyr Asp 
290 



Glu Lys Glu Arg Gin Arg Gin Tyr Ala Glu Gin 
295 300 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1016 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 3.. 62 6 

(ix) FEATURE: 

(A) NAME/KEY: 3 1 UTR 



(B) LOCATION: 627.. 1016 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

GT TTT GTT GTA CTA CTC GTT GTT GCA ATA TGG ATT GAA ATG AGC CAA 
47 

Phe Val Val Leu Leu Val Val Ala He Trp He Glu Met Ser Gin 
15 10 15 

GGC CAA CAA ATG ATC AAA CAA TGT AAA TGT TCT GAT ATT GCA CCA TGT 
95 

Gly Gin Gin Met He Lys Gin Cys Lys Cys Ser Asp He Ala Pro Cys 
20 25 30 

CAA TTA ACT GCC GTT CAA TCA GTT TTA CCA TGT GCT GAT CAA TGC CAG 
143 

Gin Leu Thr Ala Val Gin Ser Val Leu Pro Cys Ala Asp Gin Cys Gin 
35 40 45 

AAA TAT ATT ACT TCA ATT GGT GGT AAT TAT GAT CAA ATT AGT AAC TGT 
. 191 

Lys Tyr He Thr Ser He Gly Gly Asn Tyr Asp Gin He Ser Asn Cys 
50 55 60 

TTT AAA CAG AAA CAA TCA ATT ATA AAT GAT GCT ATG AAA TGT GCT CAA 
239 

Phe Lys Gin Lys Gin Ser He He Asn Asp Ala Met Lys Cys Ala Gin 
65 70 75 

GAT GCT TTC CCA AAT GCA TGC GCA CAA GGT GAA CCA AAA ATG GTA CCA 
287 

Asp Ala Phe Pro Asn Ala Cys Ala Gin Gly Glu Pro Lys Met Val Pro 

80 85 90 95 

AAA CGA TTC GGA AAA GGT CTT CAA TTA GCT GTA ATG ACT GAT ATC AAC 
335 

Lys Arg Phe Gly Lys Gly Leu Gin Leu Ala Val Met Thr Asp He Asn 
100 105 110 

AAA GAA TTA CAA CGA ATG GGA ATA GCA AAT CAA GTT ACT CAA CTA ATC 
383 

Lys Glu Leu Gin Arg Met Gly He Ala Asn Gin Val Thr Gin Leu He 
115 120 125 

TCC CAA GGT CGA CGA TTC TTT AAA TGC TTC CAA TCG TGT ATG ATG AAA 
431 

Ser Gin Gly Arg Arg Phe Phe Lys Cys Phe Gin Ser Cys Met Met Lys 
130 135 140 

AAA TTG GGC TCA TGT TCT CCA GAT TGT GGT CTT GAT TTA CCA TCT GAT 
479 

Lys Leu Gly Ser Cys Ser Pro Asp Cys Gly Leu Asp Leu Pro Ser Asp 
145 150 155 

AAT GTT ATG GTT CAA ACA GTT AAA AAT TGC GCT CAA AAA AGT GGT ATT 
527 

Asn Val Met Val Gin Thr Val Lys Asn Cys Ala Gin Lys Ser Gly He 

160 165 170 175 

CAA ACT GCA TCG GTG CAA GAT CTT TGC TTT TGC GTC GAA CAA GCT GGT 



575 

Gin Thr Ala Ser Val Gin Asp Leu Cys Phe Cys Val Glu Gin Ala Gly 
180 185 190 



ATT CGG CAA CTT TCT GAT GTA TGT CCT CGT ATA CAA ATT TTC AAA ACG 
623 

lie Arg Gin Leu Ser Asp Val Cys Pro Arg lie Gin lie Phe Lys Thr 
195 200 205 

AAA TGAGTATTGA GAATATTGCA CTAGCAGCAA TCATTATTTT TCTCGAGAAT 

676 

Lys 



TTTCGCTATC AATAAGTTGG AATATGATTA CAATAATATA TATATTAACT GCAAAAATCT 
736 



TTCTTCTTCA AAATTATTTT TCATTTCGCT CTCATAATTG CATGATAATA GTCATAATGA 
796 

AAAACAGGTT TTCTTTTTTT AAAATGATAA CTTCAAACAA ATAGGTATTT CTTGATATAT 
856 



ATATGTATGT ATGTATGTGT GTGTGTGTGT GTGTGTGTAT GTGTGTGTTT GTGTATGTGT 
916 

ATATGTATGT ATGTATGTAT GTATGTATGT ATGTGTAGGA GAAAAGCAAA CTAAACAGTA 
976 

AATGAAAGAA AAAAATAAGT CAAATAAAAG TTTGATAATT 
1016 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Phe Val Val Leu Leu Val Val Ala He Trp He Glu Met Ser Gin Gly 
15 10 15 

Gin Gin Met He Lys Gin Cys Lys Cys Ser Asp He Ala Pro Cys Gin 
20 25 30 

Leu Thr Ala Val Gin Ser Val Leu Pro Cys Ala Asp Gin Cys Gin Lys 
35 40 45 

Tyr He Thr Ser He Gly Gly Asn Tyr Asp Gin He Ser Asn Cys Phe 
50 55 60 

Lys Gin Lys Gin Ser He He Asn Asp Ala Met Lys Cys Ala Gin Asp 
65 70 75 80 



Ala Phe Pro Asn Ala Cys Ala Gin Gly Glu Pro Lys Met Val Pro Lys 
85 90 95 



Arg Phe Gly Lys 
100 

Glu Leu Gin Arg 
115 

Gin Gly Arg Arg 
130 

Leu Gly Ser Cys 
145 

Val Met Val Gin 



Thr Ala Ser Val 
180 

Arg Gin Leu Ser 
195 



Gly Leu Gin Leu 



Met Gly He Ala 
120 

Phe Phe Lys Cys 
135 

Ser Pro Asp Cys 
150 

Thr Val Lys Asn 
165 

Gin Asp Leu Cys 



Asp Val Cys Pro 
200 



Ala Val Met Thr 
105 

Asn Gin Val Thr 



Phe Gin Ser Cys 
140 

Gly Leu Asp Leu 
155 

Cys Ala Gin Lys 
170 

Phe Cys Val Glu 
185 

Arg He Gin He 



Asp He Asn Lys 
110 

Gin Leu He Ser 
125 

Met Met Lys Lys 



Pro Ser Asp Asn 
160 

Ser Gly He Gin 
175 

Gin Ala Gly He 
190 

Phe Lys Thr Lys 
205 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Asp Asp Cys Gly Asp Gly Ser Asp Glu 
1 5 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Ala Gin Asp Ala Phe Pro Asn Ala Cys Ala Gin Gly Glu Pro Lys 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Ala lie Ala Pro Cys Gin Leu Thr Ala Val Gin Ser Val Leu Pro Cys 
15 10 15 

Ala Asp Gin Cys Gin Lys 
20 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Leu Gly Ser Cys Ser Pro Asp Cys Gly Leu Asp Leu Pro Ser Asp Asn 
15 10 15 

Val Met Val Gin Asp Val 
20 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION: 1..23 

(D) OTHER INFORMATION: /label= PRIMER 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

CGGGATCCCG AGTTAAATAG TCG 
23 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION: 1..17 

(D) OTHER INFORMATION: /label- PRIMER 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

TGCAGGATCC TGCACCG 
17 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..50 

(D) OTHER INFORMATION: /label= PRIMER 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GAGAGAGAGA GAGAGAGAGA ACTAGTCTCG AGTTTTTTTT TTTTTTTTTT 
50 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..17 

(D) OTHER INFORMATION: /label- PRIMER 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

TGYTCNCCNG AYTGYGG 
17 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION: 1..17 

(D) OTHER INFORMATION: /label= PRIMER 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

TGYAGTCCNG AYTGYGG 
17 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: mi sc_f eature 

(B) LOCATION: 1..17 

(D) OTHER INFORMATION: /label= PROBE 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

TGNACCATNA CRTTRTC 
17 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION: 1..27 

(D) OTHER INFORMATION: /label= PROBE 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CTGTTTGAAC CATAACATTA CAGATGG 
27 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

<ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION: 1..25 

(D) OTHER INFORMATION: /label* PRIMER 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

GTTGCAATAT GGGATCCAAT GAGCC 
25 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1. .25 

(D) OTHER INFORMATION: /label= PRIMER 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CGCTAGTGCA GGATCCTCAA TACTC 
25 



